To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??肉ヨぜ恂??????嚥???筌??陰 1000100010100011001111110011111110010011111101111000001110001000100000101011101010011100100101100011111100111111001111110011111100111111001111111001101010001011001111110011111100111111111000101010001100111111001111111000100101000001 88a33f3f93f7838882ba9c963f3f3f3f3f3f9a8b3f3f3fe2a33f3f8941
EUC-JP 哀??肉ヨぜ恂??孼???嚥???筌??陰 10110000101001010011111100111111110001101111100110100101111010001010010010111100110101111111011000111111001111111000111110111010110000110011111100111111001111111101001111101011001111110011111100111111111001001010010100111111001111111011000110100010 b0a53f3fc6f9a5e8a4bcd7f63f3f8fbac33f3f3fd3eb3f3f3fe4a53f3fb1a2
UTF-8 哀노맧肉ヨぜ恂㏃쑐孼꾊딆죳嚥싲갭흟筌욊퇍陰 111001011001001110000000111010111000010110111000111010111010011110100111111010001000001010001001111000111000001110101000111000111000000110011100111001101000000110000010111000111000111110000011111011001001000110010000111001011010110110111100111010101011111010001010111010111001010010000110111011001010001110110011111001011001101010100101111011001000101110110010111010101011000010101101111011011001110110011111111001111010110110001100111011001001101010001010111011011000011110001101111010011001100110110000 e59380eb85b8eba7a7e88289e383a8e3819ce68182e38f83ec9190e5adbceabe8aeb9486eca3b3e59aa5ec8bb2eab0aded9d9fe7ad8cec9a8aed878de999b0
UHC 哀노맧肉ヨぜ恂㏃쑐孼꾊딆죳嚥싲갭흟筌욊퇍陰 111001001110111010110011111010111001000010110000111010111011111110101011111010001010101010111100111000101110000110100111111011001001110010101111111001011110110110000100110100011000101011101100101000011000111011100110101111111001101011101011101100001011100011000101100000011110111110100111100111101110101010110111100111101110101111100100 e4eeb3eb90b0ebbfabe8aabce2e1a7ec9cafe5ed84d18aeca18ee6bf9aebb0b8c581efa79eeab79eebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)