To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 癲〓?意??怨??Lh癲〓?意??怨??L 1110000110011111100000011010110000111111100010001101001100111111001111111000100110000101001111110011111101001100011010001110000110011111100000011010110000111111100010001101001100111111001111111000100110000101001111110011111101001100 e19f81ac3f88d33f3f89853f3f4c68e19f81ac3f88d33f3f89853f3f4c
EUC-JP 癲〓?意??怨??Lh癲〓?意??怨??L 1110001010100001101000101010111000111111101100001101010100111111001111111011000111100101001111110011111101001100011010001110001010100001101000101010111000111111101100001101010100111111001111111011000111100101001111110011111101001100 e2a1a2ae3fb0d53f3fb1e53f3f4c68e2a1a2ae3fb0d53f3fb1e53f3f4c
UTF-8 癲〓쵑意쀯쭓怨뺤졁Lh癲〓쵑意쀯쭓怨뺤졁L 111001111001100110110010111000111000000010010011111011001011010110010001111001101000010010001111111011001000000010101111111011001010110110010011111001101000000010101000111010111011101010100100111011001010000110000001010011000110100011100111100110011011001011100011100000001001001111101100101101011001000111100110100001001000111111101100100000001010111111101100101011011001001111100110100000001010100011101011101110101010010011101100101000011000000101001100 e799b2e38093ecb591e6848fec80afecad93e680a8ebbaa4eca1814c68e799b2e38093ecb591e6848fec80afecad93e680a8ebbaa4eca1814c
UHC 癲〓쵑意쀯쭓怨뺤졁Lh癲〓쵑意쀯쭓怨뺤졁L 111011111010011010100001111010111010110010010011111010111111001010010111111011111010011110001011111010101011001110010101111011001010000010110010010011000110100011101111101001101010000111101011101011001001001111101011111100101001011111101111101001111000101111101010101100111001010111101100101000001011001001001100 efa6a1ebac93ebf297efa78beab395eca0b24c68efa6a1ebac93ebf297efa78beab395eca0b24c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)