To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??肉ヨぜ恂??????癲??魏??夷 10001000101000110011111100111111100100111111011110000011100010001000001010111010100111001001011000111111001111110011111100111111001111110011111111100001100111110011111100111111111010011011000000111111001111111000100011001110 88a33f3f93f7838882ba9c963f3f3f3f3f3fe19f3f3fe9b03f3f88ce
EUC-JP 哀??肉ヨぜ恂??孼???癲??魏??夷 101100001010010100111111001111111100011011111001101001011110100010100100101111001101011111110110001111110011111110001111101110101100001100111111001111110011111111100010101000010011111100111111111100101011001000111111001111111011000011010000 b0a53f3fc6f9a5e8a4bcd7f63f3f8fbac33f3f3fe2a13f3ff2b23f3fb0d0
UTF-8 哀노맧肉ヨぜ恂㏃쑐孼꾊딆죳癲놁옓魏꾣껸夷 111001011001001110000000111010111000010110111000111010111010011110100111111010001000001010001001111000111000001110101000111000111000000110011100111001101000000110000010111000111000111110000011111011001001000110010000111001011010110110111100111010101011111010001010111010111001010010000110111011001010001110110011111001111001100110110010111010111000011010000001111011001001100010010011111010011010110110001111111010101011111010100011111010101011101110111000111001011010010010110111 e59380eb85b8eba7a7e88289e383a8e3819ce68182e38f83ec9190e5adbceabe8aeb9486eca3b3e799b2eb8681ec9893e9ad8feabea3eabbb8e5a4b7
UHC 哀노맧肉ヨぜ恂㏃쑐孼꾊딆죳癲놁옓魏꾣껸夷 11100100111011101011001111101011100100001011000011101011101111111010101111101000101010101011110011100010111000011010011111101100100111001010111111100101111011011000010011010001100010101110110010100001100011101110111110100110100001101110110010011110100110011110101011100000100001001110011010110010101110011110110010101000 e4eeb3eb90b0ebbfabe8aabce2e1a7ec9cafe5ed84d18aeca18eefa686ec9e99eae084e6b2b9eca8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)