To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 訝?ぜ節??厭??帳??鸚??厭??蘊 111001100110001000111111100000101011101010010000110111110011111100111111100010010111110100111111001111111001001010100000001111110011111111101010010111110011111100111111100010010111110100111111001111111110010101011101 e6623f82ba90df3f3f897d3f3f92a03f3fea5f3f3f897d3f3fe55d
EUC-JP 訝?ぜ節??厭??帳??鸚??厭??蘊 111010111100001100111111101001001011110011000000111000010011111100111111101100011101111000111111001111111100010010100010001111110011111111110011110000000011111100111111101100011101111000111111001111111110100110111110 ebc33fa4bcc0e13f3fb1de3f3fc4a23f3ff3c03f3fb1de3f3fe9be
UTF-8 訝딃ぜ節몇뀳厭얕퍓帳⑶뮈鸚까뀳厭얕쵛蘊 111010001010100010011101111010111001010010000011111000111000000110011100111001111010111110000000111010111010101010000111111010111000000010110011111001011000111010101101111011001001011010010101111011011000110110010011111001011011100010110011111000101001000110110110111010111010111010001000111010011011100010011010111010101011100110001100111010111000000010110011111001011000111010101101111011001001011010010101111011001011010110011011111010001001100010001010 e8a89deb9483e3819ce7af80ebaa87eb80b3e58eadec9695ed8d93e5b8b3e291b6ebae88e9b89aeab98ceb80b3e58eadec9695ecb59be8988a
UHC 訝딃ぜ節몇뀳厭얕퍓帳⑶뮈鸚까뀳厭얕쵛蘊 1110010010111000100010101110100110101010101111001110111110111101101110001110111010000101101010011110011011110100101111101110100010111011100010101110110111100011101010011110100110111001101111111110010110100100101100011110111010000101101010011110011011110100101111101110100010101100100111011110100010110011 e4b88ae9aabcefbdb8ee85a9e6f4bee8bb8aede3a9e9b9bfe5a4b1ee85a9e6f4bee8ac9de8b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)