To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????AB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100000101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f4142
SJIS-WIN 弔?移?衣????移?衣??AB 100100101010001000111111100010001101101000111111100010001101111100111111001111110011111100111111100010001101101000111111100010001101111100111111001111110100000101000010 92a23f88da3f88df3f3f3f3f88da3f88df3f3f4142
EUC-JP 弔?移?衣????移?衣?汶AB 1100010010100100001111111011000011011100001111111011000011100001001111110011111100111111001111111011000011011100001111111011000011100001001111111000111111000110111001010100000101000010 c4a43fb0dc3fb0e13f3f3f3fb0dc3fb0e13f8fc6e54142
UTF-8 弔렟移렊衣쯔렣곌㉢移렊衣쮜汶AB 1110010110111100100101001110101110100000100111111110011110100111101110111110101110100000100010101110100010100001101000111110110010101111100101001110101110100000101000111110101010110011100011001110001110001001101000101110011110100111101110111110101110100000100010101110100010100001101000111110110010101110100111001110011010110001101101100100000101000010 e5bc94eba09fe7a7bbeba08ae8a1a3ecaf94eba0a3eab38ce389a2e7a7bbeba08ae8a1a3ecae9ce6b1b64142
UHC 弔렟移렊衣쯔렣곌㉢移렊衣쮜汶AB 111100001100000010001110101100001110110010111001100011101010000111101011111111011100001011101010100011101011010010110000111010101010100010110011111011001011100110001110101000011110101111111101110000101110100011011010101000010100000101000010 f0c08eb0ecb98ea1ebfdc2ea8eb4b0eaa8b3ecb98ea1ebfdc2e8daa14142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)