To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘂??意??儒??筌??循?┃???鈺 1110010101000001001111110011111110001000110100110011111100111111100011101111001000111111001111111110001010100011001111110011111110001111011110100011111110000100101010110011111100111111001111111111101111000100 e5413f3f88d33f3f8ef23f3fe2a33f3f8f7a3f84ab3f3f3ffbc4
EUC-JP 蘂??意??儒??筌??循?┃洹??鈺 1110100110100010001111110011111110110000110101010011111100111111101111001111010000111111001111111110010010100101001111110011111110111101110110110011111110101000101011011000111111000111101110100011111100111111100011111110001111010101 e9a23f3fb0d53f3fbcf43f3fe4a53f3fbddb3fa8ad8fc7ba3f3f8fe3d5
UTF-8 蘂띠눖意덄뙴儒룹묾筌믩끃循녜┃洹숆틕鈺 111010001001100010000010111010111001110110100000111010111000100010010110111001101000010010001111111010111000110110000100111010111001100110110100111001011000010010010010111010111010001110111001111010111010110010111110111001111010110110001100111010111010111110101001111010111000000110000011111001011011111010101010111010111000010110011100111000101001010010000011111001101011010010111001111011001000100010000110111011011000101110010101111010011000100010111010 e89882eb9da0eb8896e6848feb8d84eb99b4e58492eba3b9ebacbee7ad8cebafa9eb8183e5beaaeb859ce29483e6b4b9ec8886ed8b95e988ba
UHC 蘂띠눖意덄뙴儒룹묾筌믩끃循녜┃洹숆틕鈺 1110011111011110101101101110110010000111101100001110101111110010100010001110011110001100101101111110101011100011101101111110110010111001101100101110111110100111100100101110101110000101101110011110001011100000101100111110100110100110101011011110101010110111100110011110101010111010100000111110100010101101 e7deb6ec87b0ebf288e78cb7eae3b7ecb9b2efa792eb85b9e2e0b3e9a6adeab799eaba83e8ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)