To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???お?悠い?日い?鷹い’??健い? 0011111100111111001111111000001010101000001111111001011101001001100000101010001000111111100100111111101010000010101000100011111110010001111010011000001010100010100000010110011000111111001111111000110010010010100000101010001000111111 3f3f3f82a83f974982a23f93fa82a23f91e982a281663f3f8c9282a23f
EUC-JP ???お?悠い?日い?鷹い’??健い? 0011111100111111001111111010010010101010001111111100110110101010101001001010010000111111110001101111110010100100101001000011111111000010111010111010010010100100101000011100011100111111001111111011011111110010101001001010010000111111 3f3f3fa4aa3fcdaaa4a43fc6fca4a43fc2eba4a4a1c73f3fb7f2a4a43f
UTF-8 룵ㄱ캀お룫悠い룫日い룫鷹い’룵ㄲ健い룫 111010111010001110110101111000111000010010110001111011001011101010000000111000111000000110001010111010111010001110101011111001101000001010100000111000111000000110000100111010111010001110101011111001101001011110100101111000111000000110000100111010111010001110101011111010011011011110111001111000111000000110000100111000101000000010011001111010111010001110110101111000111000010010110010111001011000000110100101111000111000000110000100111010111010001110101011 eba3b5e384b1ecba80e3818aeba3abe682a0e38184eba3abe697a5e38184eba3abe9b7b9e38184e28099eba3b5e384b2e581a5e38184eba3ab
UHC 룵ㄱ캀お룫悠い룫日い룫鷹い’룵ㄲ健い룫 1000111110101010101001001010000110101111100011111010101010101010100011111010001011101010111011011010101010100100100011111010001011101100111011011010101010100100100011111010001011101011111011011010101010100100101000011010111110001111101010101010010010100010110010111110110110101010101001001000111110100010 8faaa4a1af8faaaa8fa2eaedaaa48fa2ecedaaa48fa2ebedaaa4a1af8faaa4a2cbedaaa48fa2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)