To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???肄??飮??沃??悠??矣??壤??B 00111111001111110011111111100011111001010011111100111111100111110101101000111111001111111001011110000000001111110011111110010111010010010011111100111111111000011110000100111111001111111001101011011111001111110011111101000010 3f3f3fe3e53f3f9f5a3f3f97803f3f97493f3fe1e13f3f9adf3f3f42
EUC-JP ???肄??飮??沃??悠??矣??壤??B 00111111001111110011111111100110111001110011111100111111110111011011101100111111001111111100110111100000001111110011111111001101101010100011111100111111111000101110001100111111001111111101010011100001001111110011111101000010 3f3f3fe6e73f3fddbb3f3fcde03f3fcdaa3f3fe2e33f3fd4e13f3f42
UTF-8 嶺뚯쉶肄끾끽飮뗣룋沃섅꺃悠띄춯矣롫닔壤쏆눙B 11101111101001101010101111101011100110101010111111101100100010011011011011101000100000101000010011101011100000011011111011101011100000011011110111101001101000111010111011101011100101111010001111101011101000111000101111100110101100101000001111101100100001001000010111101010101110101000001111100110100000101010000011101011100111011000010011101100101101101010111111100111100111111010001111101011101000011010101111101011100010111001010011100101101000111010010011101100100011111000011011101011100010001001100101000010 efa6abeb9aafec89b6e88284eb81beeb81bde9a3aeeb97a3eba38be6b283ec8485eaba83e682a0eb9d84ecb6afe79fa3eba1abeb8b94e5a3a4ec8f86eb889942
UHC 嶺뚯쉶肄끾끽飮뗣룋沃섅꺃悠띄춯矣롫닔壤쏆눙B 11100111101011011000110011101100100110101000110011101100101111011000010111100110101100111010001111101011111001101000101111100011100011111000101011101000101010101001100011100011100000111010110011101010111011011011011011100111101011011000110011101011111110001000111011101011100010001001100011100101101111011001101111101100101101001011000101000010 e7ad8cec9a8cecbd85e6b3a3ebe68be38f8ae8aa98e383aceaedb6e7ad8cebf88eeb8898e5bd9becb4b142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)