To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???鍮??蹂??閻??逾??遺??鈺 00111111001111110011111111101000010010100011111100111111111001101111100000111111001111111110100010000101001111110011111111100111101001010011111100111111100010001110001000111111001111111111101111000100 3f3f3fe84a3f3fe6f83f3fe8853f3fe7a53f3f88e23f3ffbc4
EUC-JP ???鍮??蹂??閻??逾??遺??鈺 0011111100111111001111111110111110101011001111110011111111101100111110100011111100111111111011111110010100111111001111111110111010100111001111110011111110110000111001000011111100111111100011111110001111010101 3f3f3fefab3f3fecfa3f3fefe53f3feea73f3fb0e43f3f8fe3d5
UTF-8 略노쵐鍮볞슭蹂㏂걶閻롡뱿逾뷴쫩遺듦틕鈺 111011111010010110110110111010111000010110111000111011001011010110010000111010011000110110101110111010111011001110011110111011001000101010101101111010001011100110000010111000111000111110000010111010101011000110110110111010011001011010111011111010111010000110100001111010111011000110111111111010011000000010111110111010111011011110110100111011001010101110101001111010011000000110111010111010111001001110100110111011011000101110010101111010011000100010111010 efa5b6eb85b8ecb590e98daeebb39eec8aade8b982e38f82eab1b6e996bbeba1a1ebb1bfe980beebb7b4ecaba9e981baeb93a6ed8b95e988ba
UHC 略노쵐鍮볞슭蹂㏂걶閻롡뱿逾뷴쫩遺듦틕鈺 1110010110110010101100111110101110101100100100101110101110111001100100111110010010111101101111101110101110110011101000101110001110000001100111001110011110100010100011101110001010010011101001011110101110110101101110101110010110100110100000101110101110110110101101011110101010111010100000111110100010101101 e5b2b3ebac92ebb993e4bdbeebb3a2e3819ce7a28ee293a5ebb5bae5a682ebb6b5eaba83e8ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)