To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 玉??映θ?玉??}v玉??映θ?玉??}vB 10001011110010100011111100111111100010010110011010000011110001100011111110001011110010100011111100111111011111010111011010001011110010100011111100111111100010010110011010000011110001100011111110001011110010100011111100111111011111010111011001000010 8bca3f3f896683c63f8bca3f3f7d768bca3f3f896683c63f8bca3f3f7d7642
EUC-JP 玉??映θ?玉??}v玉??映θ?玉??}vB 10110110110011000011111100111111101100011100011110100110110010000011111110110110110011000011111100111111011111010111011010110110110011000011111100111111101100011100011110100110110010000011111110110110110011000011111100111111011111010111011001000010 b6cc3f3fb1c7a6c83fb6cc3f3f7d76b6cc3f3fb1c7a6c83fb6cc3f3f7d7642
UTF-8 玉앭츪映θ룜玉앭츪}v玉앭츪映θ룜玉앭츪}vB 111001111000111010001001111011001001010110101101111011001011100010101010111001101001100010100000110011101011100011101011101000111001110011100111100011101000100111101100100101011010110111101100101110001010101001111101011101101110011110001110100010011110110010010101101011011110110010111000101010101110011010011000101000001100111010111000111010111010001110011100111001111000111010001001111011001001010110101101111011001011100010101010011111010111011001000010 e78e89ec95adecb8aae698a0ceb8eba39ce78e89ec95adecb8aa7d76e78e89ec95adecb8aae698a0ceb8eba39ce78e89ec95adecb8aa7d7642
UHC 玉앭츪映θ룜玉앭츪}v玉앭츪映θ룜玉앭츪}vB 1110100010101100100111011110010110101110100111111110011110110001101001011110100010001111100110001110100010101100100111011110010110101110100111110111110101110110111010001010110010011101111001011010111010011111111001111011000110100101111010001000111110011000111010001010110010011101111001011010111010011111011111010111011001000010 e8ac9de5ae9fe7b1a5e88f98e8ac9de5ae9f7d76e8ac9de5ae9fe7b1a5e88f98e8ac9de5ae9f7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)