To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???お?悠い?日い?鷹い’??健い??お 0011111100111111001111111000001010101000001111111001011101001001100000101010001000111111100100111111101010000010101000100011111110010001111010011000001010100010100000010110011000111111001111111000110010010010100000101010001000111111001111111000001010101000 3f3f3f82a83f974982a23f93fa82a23f91e982a281663f3f8c9282a23f3f82a8
EUC-JP ???お?悠い?日い?鷹い’??健い??お 0011111100111111001111111010010010101010001111111100110110101010101001001010010000111111110001101111110010100100101001000011111111000010111010111010010010100100101000011100011100111111001111111011011111110010101001001010010000111111001111111010010010101010 3f3f3fa4aa3fcdaaa4a43fc6fca4a43fc2eba4a4a1c73f3fb7f2a4a43f3fa4aa
UTF-8 룵ㄱ캀お룫悠い룫日い룫鷹い’룵ㄲ健い룫혧お 111010111010001110110101111000111000010010110001111011001011101010000000111000111000000110001010111010111010001110101011111001101000001010100000111000111000000110000100111010111010001110101011111001101001011110100101111000111000000110000100111010111010001110101011111010011011011110111001111000111000000110000100111000101000000010011001111010111010001110110101111000111000010010110010111001011000000110100101111000111000000110000100111010111010001110101011111011011001100010100111111000111000000110001010 eba3b5e384b1ecba80e3818aeba3abe682a0e38184eba3abe697a5e38184eba3abe9b7b9e38184e28099eba3b5e384b2e581a5e38184eba3abed98a7e3818a
UHC 룵ㄱ캀お룫悠い룫日い룫鷹い’룵ㄲ健い룫혧お 100011111010101010100100101000011010111110001111101010101010101010001111101000101110101011101101101010101010010010001111101000101110110011101101101010101010010010001111101000101110101111101101101010101010010010100001101011111000111110101010101001001010001011001011111011011010101010100100100011111010001011000010100011111010101010101010 8faaa4a1af8faaaa8fa2eaedaaa48fa2ecedaaa48fa2ebedaaa4a1af8faaa4a2cbedaaa48fa2c28faaaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)