To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??循??瑤??唯??攸??吾??媛 111000011001111100111111001111111000101101011000001111110011111110001111011110100011111100111111111010101010001000111111001111111001011101000010001111110011111110011101101111110011111100111111100011001110000100111111001111111001010101010001 e19f3f3f8b583f3f8f7a3f3feaa23f3f97423f3f9dbf3f3f8ce13f3f9551
EUC-JP 癲??宜??循??瑤??唯??攸??吾??媛 111000101010000100111111001111111011010110111001001111110011111110111101110110110011111100111111111101001010010000111111001111111100110110100011001111110011111111011010110000010011111100111111101110001110001100111111001111111100100110110010 e2a13f3fb5b93f3fbddb3f3ff4a43f3fcda33f3fdac13f3fb8e33f3fc9b2
UTF-8 癲덈챶宜방쨫循녿짎瑤녹쥓唯㎫춯攸꾪뜔吾몃쑜媛 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111011000010101001111011001010100010101011111001011011111010101010111010111000010110111111111011001010011110001110111001111001000110100100111010111000010110111001111011001010010110010011111001011001010010101111111000111000111010101011111011001011011010101111111001101001010010111000111010101011111010101010111010111001110010010100111001011001000010111110111010111010101010000011111011001001000110011100111001011010101010011011 e799b2eb8d88ecb1b6e5ae9cebb0a9eca8abe5beaaeb85bfeca78ee791a4eb85b9eca593e594afe38eabecb6afe694b8eabeaaeb9c94e590beebaa83ec919ce5aa9b
UHC 癲덈챶宜방쨫循녿짎瑤녹쥓唯㎫춯攸꾪뜔吾몃쑜媛 1110111110100110100010001110101110101010100000111110101111110001101110011110011010100100100001011110001011100000100001101110101110100011100110101110100011111101101100111110110010100010100010101110101011100110101001111110011110101101100011001110101011110010100001001110110110001101100101111110011111101110101110001110101110011100101110111110101010110000 efa688ebaa83ebf1b9e6a485e2e086eba39ae8fdb3eca28aeae6a7e7ad8ceaf284ed8d97e7eeb8eb9cbbeab0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)