To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 畑??飮?Ⅴ揄??Lh畑??飮?Ⅴ揄??L 1001010010101000001111110011111110011111010110100011111110000111010110001001110110001001001111110011111101001100011010001001010010101000001111110011111110011111010110100011111110000111010110001001110110001001001111110011111101001100 94a83f3f9f5a3f87589d893f3f4c6894a83f3f9f5a3f87589d893f3f4c
EUC-JP 畑??飮??揄??Lh畑??飮??揄??L 110010001010101000111111001111111101110110111011001111110011111111011001111010010011111100111111010011000110100011001000101010100011111100111111110111011011101100111111001111111101100111101001001111110011111101001100 c8aa3f3fddbb3f3fd9e93f3f4c68c8aa3f3fddbb3f3fd9e93f3f4c
UTF-8 畑대벝飮깍Ⅴ揄명뫑Lh畑대벝飮깍Ⅴ揄명뫑L 111001111001010110010001111010111000110010000000111010111011001010011101111010011010001110101110111010101011100110001101111000101000010110100100111001101000111110000100111010111010101010000101111010111010101110010001010011000110100011100111100101011001000111101011100011001000000011101011101100101001110111101001101000111010111011101010101110011000110111100010100001011010010011100110100011111000010011101011101010101000010111101011101010111001000101001100 e79591eb8c80ebb29de9a3aeeab98de285a4e68f84ebaa85ebab914c68e79591eb8c80ebb29de9a3aeeab98de285a4e68f84ebaa85ebab914c
UHC 畑대벝飮깍Ⅴ揄명뫑Lh畑대벝飮깍Ⅴ揄명뫑L 111011111010010110110100111010111001001110111000111010111110011010110001111011111010010110110100111010101111000110111000111011011001000110110011010011000110100011101111101001011011010011101011100100111011100011101011111001101011000111101111101001011011010011101010111100011011100011101101100100011011001101001100 efa5b4eb93b8ebe6b1efa5b4eaf1b8ed91b34c68efa5b4eb93b8ebe6b1efa5b4eaf1b8ed91b34c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)