To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 晶」|J}晶」|J{^ 11100110100110011011011011101110100011001011110111101111101111011010001101111100010010100111110111100110100110011011011011101110100011001011110111101111101111011010001101111100010010100111101101011110 e699b6ee8cbdefbda37c4a7de699b6ee8cbdefbda37c4a7b5e
SJIS-WIN ??¶?????£|J}??¶?????£|J{^ 0011111100111111100000011111011100111111001111110011111100111111001111111000000110010010011111000100101001111101001111110011111110000001111101110011111100111111001111110011111100111111100000011001001001111100010010100111101101011110 3f3f81f73f3f3f3f3f81927c4a7d3f3f81f73f3f3f3f3f81927c4a7b5e
EUC-JP æ?¶î??ï?£|J}æ?¶î??ï?£|J{^ 1000111110101001110000010011111110100010111110011000111110101011110000100011111100111111100011111010101111000001001111111010000111110010011111000100101001111101100011111010100111000001001111111010001011111001100011111010101111000010001111110011111110001111101010111100000100111111101000011111001001111100010010100111101101011110 8fa9c13fa2f98fabc23f3f8fabc13fa1f27c4a7d8fa9c13fa2f98fabc23f3f8fabc13fa1f27c4a7b5e
UTF-8 晶」|J}晶」|J{^ 11000011101001101100001010011001110000101011011011000011101011101100001010001100110000101011110111000011101011111100001010111101110000101010001101111100010010100111110111000011101001101100001010011001110000101011011011000011101011101100001010001100110000101011110111000011101011111100001010111101110000101010001101111100010010100111101101011110 c3a6c299c2b6c3aec28cc2bdc3afc2bdc2a37c4a7dc3a6c299c2b6c3aec28cc2bdc3afc2bdc2a37c4a7b5e
UHC æ?¶??½?½?|J}æ?¶??½?½?|J{^ 101010011010000100111111101000101101001000111111001111111010100011110110001111111010100011110110001111110111110001001010011111011010100110100001001111111010001011010010001111110011111110101000111101100011111110101000111101100011111101111100010010100111101101011110 a9a13fa2d23f3fa8f63fa8f63f7c4a7da9a13fa2d23f3fa8f63fa8f63f7c4a7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)