To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 萸뗫쳡萸덈쳭萸껊쳭gB 1110100010010000101110001110101110010111101010111110110010110011101000011110100010010000101110001110101110001101100010001110110010110011101011011110100010010000101110001110101010111011100010101110110010110011101011010110011101000010 e890b8eb97abecb3a1e890b8eb8d88ecb3ade890b8eabb8aecb3ad6742
SJIS-WIN ???????????????????????????gB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110011101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6742
EUC-JP è?¸ë??ì?¡è?¸ë??ì??è?¸ê??ì??gB 10001111101010111011001000111111100011111010001010110001100011111010101110110011001111110011111110001111101010111100000000111111100011111010001011000010100011111010101110110010001111111000111110100010101100011000111110101011101100110011111100111111100011111010101111000000001111110011111110001111101010111011001000111111100011111010001010110001100011111010101110110100001111110011111110001111101010111100000000111111001111110110011101000010 8fabb23f8fa2b18fabb33f3f8fabc03f8fa2c28fabb23f8fa2b18fabb33f3f8fabc03f3f8fabb23f8fa2b18fabb43f3f8fabc03f3f6742
UTF-8 萸뗫쳡萸덈쳭萸껊쳭gB 1100001110101000110000101001000011000010101110001100001110101011110000101001011111000010101010111100001110101100110000101011001111000010101000011100001110101000110000101001000011000010101110001100001110101011110000101000110111000010100010001100001110101100110000101011001111000010101011011100001110101000110000101001000011000010101110001100001110101010110000101011101111000010100010101100001110101100110000101011001111000010101011010110011101000010 c3a8c290c2b8c3abc297c2abc3acc2b3c2a1c3a8c290c2b8c3abc28dc288c3acc2b3c2adc3a8c290c2b8c3aac2bbc28ac3acc2b3c2ad6742
UHC ??¸????³¡??¸????³­??¸????³­gB 0011111100111111101000101010110000111111001111110011111100111111101010011111100010100010101011100011111100111111101000101010110000111111001111110011111100111111101010011111100010100001101010010011111100111111101000101010110000111111001111110011111100111111101010011111100010100001101010010110011101000010 3f3fa2ac3f3f3f3fa9f8a2ae3f3fa2ac3f3f3f3fa9f8a1a93f3fa2ac3f3f3f3fa9f8a1a96742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)