To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?ぁ?い?鷹い?旌?ぁ?い?鷹い?政^ 0011111110000010100111110011111110000010101000100011111110010001111010011000001010100010001111111001110111010101001111111000001010011111001111111000001010100010001111111001000111101001100000101010001000111111100100001010110101011110 3f829f3f82a23f91e982a23f9dd53f829f3f82a23f91e982a23f90ad5e
EUC-JP ?ぁ?い?鷹い?旌?ぁ?い?鷹い?政^ 0011111110100100101000010011111110100100101001000011111111000010111010111010010010100100001111111101101011010111001111111010010010100001001111111010010010100100001111111100001011101011101001001010010000111111110000001010111101011110 3fa4a13fa4a43fc2eba4a43fdad73fa4a13fa4a43fc2eba4a43fc0af5e
UTF-8 룵ぁ캀い룫鷹い룫旌룵ぁ캀い룫鷹い룫政^ 11101011101000111011010111100011100000011000000111101100101110101000000011100011100000011000010011101011101000111010101111101001101101111011100111100011100000011000010011101011101000111010101111100110100101111000110011101011101000111011010111100011100000011000000111101100101110101000000011100011100000011000010011101011101000111010101111101001101101111011100111100011100000011000010011101011101000111010101111100110100101001011111101011110 eba3b5e38181ecba80e38184eba3abe9b7b9e38184eba3abe6978ceba3b5e38181ecba80e38184eba3abe9b7b9e38184eba3abe694bf5e
UHC 룵ぁ캀い룫鷹い룫旌룵ぁ캀い룫鷹い룫政^ 10001111101010101010101010100001101011111000111110101010101001001000111110100010111010111110110110101010101001001000111110100010111011111101101110001111101010101010101010100001101011111000111110101010101001001000111110100010111010111110110110101010101001001000111110100010111011111101100101011110 8faaaaa1af8faaa48fa2ebedaaa48fa2efdb8faaaaa1af8faaa48fa2ebedaaa48fa2efd95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)