To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?ぁ?お??い??ぁ?い?悠い??お??い? 00111111100000101001111100111111100000101010100000111111001111111000001010100010001111110011111110000010100111110011111110000010101000100011111110010111010010011000001010100010001111110011111110000010101010000011111100111111100000101010001000111111 3f829f3f82a83f3f82a23f3f829f3f82a23f974982a23f3f82a83f3f82a23f
EUC-JP ?ぁ?お??い??ぁ?い?悠い??お??い? 00111111101001001010000100111111101001001010101000111111001111111010010010100100001111110011111110100100101000010011111110100100101001000011111111001101101010101010010010100100001111110011111110100100101010100011111100111111101001001010010000111111 3fa4a13fa4aa3f3fa4a43f3fa4a13fa4a43fcdaaa4a43f3fa4aa3f3fa4a43f
UTF-8 룵ぁ캀お룫횕い▩룵ぁ캀い룫悠い룫킃お룫횕い룫 111010111010001110110101111000111000000110000001111011001011101010000000111000111000000110001010111010111010001110101011111011011001101010010101111000111000000110000100111000101001011010101001111010111010001110110101111000111000000110000001111011001011101010000000111000111000000110000100111010111010001110101011111001101000001010100000111000111000000110000100111010111010001110101011111011011000001010000011111000111000000110001010111010111010001110101011111011011001101010010101111000111000000110000100111010111010001110101011 eba3b5e38181ecba80e3818aeba3abed9a95e38184e296a9eba3b5e38181ecba80e38184eba3abe682a0e38184eba3abed8283e3818aeba3abed9a95e38184eba3ab
UHC 룵ぁ캀お룫횕い▩룵ぁ캀い룫悠い룫킃お룫횕い룫 1000111110101010101010101010000110101111100011111010101010101010100011111010001011000011100011111010101010100100101000101100110010001111101010101010101010100001101011111000111110101010101001001000111110100010111010101110110110101010101001001000111110100010101101001000111110101010101010101000111110100010110000111000111110101010101001001000111110100010 8faaaaa1af8faaaa8fa2c38faaa4a2cc8faaaaa1af8faaa48fa2eaedaaa48fa2b48faaaa8fa2c38faaa48fa2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)