To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????雅?????揄??夭??竊?? 00111111001111110011111100111111001111110011111100111111001111110011111110001001111010110011111100111111001111110011111100111111100111011000100100111111001111111001101011101110001111110011111111100010100001100011111100111111 3f3f3f3f3f3f3f3f3f89eb3f3f3f3f3f9d893f3f9aee3f3fe2863f3f
EUC-JP ?????????雅??沅??揄??夭??竊?? 001111110011111100111111001111110011111100111111001111110011111100111111101100101110110100111111001111111000111111000110111010010011111100111111110110011110100100111111001111111101010011110000001111110011111111100011111001100011111100111111 3f3f3f3f3f3f3f3f3fb2ed3f3f8fc6e93f3fd9e93f3fd4f03f3fe3e63f3f
UTF-8 列룸쑜理덃걗栒삼폇雅뚮떞沅잏뙴揄앹탦夭뽰옓竊뽪틦 111011111010011010011100111010111010001110111000111011001001000110011100111011111010011110100100111010111000110110000011111010101011000110010111111001101010000010010010111011001000001010111100111011011000111110000111111010011001101110000101111010111001101010101110111010111001011010011110111001101011001010000101111011001001111010001111111010111001100110110100111001101000111110000100111011001001010110111001111011011000001110100110111001011010010010101101111010111011110110110000111011001001100010010011111001111010101110001010111010111011110110101010111011011000101110100110 efa69ceba3b8ec919cefa7a4eb8d83eab197e6a092ec82bced8f87e99b85eb9aaeeb969ee6b285ec9e8feb99b4e68f84ec95b9ed83a6e5a4adebbdb0ec9893e7ab8aebbdaaed8ba6
UHC 列룸쑜理덃걗栒삼폇雅뚮떞沅잏뙴揄앹탦夭뽰옓竊뽪틦 111001101110101010110111111010111001110010111011111011001011010110001000111001101000000110000010111000101110001110111011111011111011110010010100111001001011101010001100111010111000101110110100111010101011011010011111111001111000110010110111111010101111000110011101111011001011010110001000111010001110110010010110111011001001111010011001111011111011110010010110111001101011101010010000 e6eab7eb9cbbecb588e68182e2e3bbefbc94e4ba8ceb8bb4eab69fe78cb7eaf19decb588e8ec96ec9e99efbc96e6ba90

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)