To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?揖??音??孃る?援η?柔j?筌 111000011001111110000011100010110011111110010111010010110011111100111111100010011011100100111111001111111001101101101111100000101110100100111111100010011000011110000011110001010011111110001111010111111000001010001010001111111110001010100011 e19f838b3f974b3f3f89b93f3f9b6f82e93f898783c53f8f5f828a3fe2a3
EUC-JP 癲ル?揖??音??孃る?援η?柔j?筌 111000101010000110100101111010110011111111001101101011000011111100111111101100101011101100111111001111111101010111010000101001001110101100111111101100011110011110100110110001110011111110111101110000001010001111101010001111111110010010100101 e2a1a5eb3fcdac3f3fb2bb3f3fd5d0a4eb3fb1e7a6c73fbdc0a3ea3fe4a5
UTF-8 癲ル슣揖밧츦音쎌춷孃る돉援η춯柔j덩筌 1110011110011001101100101110001110000011101010111110110010001010101000111110011010001111100101101110101110110000101001111110110010111000101001101110100110011111101100111110110010001110100011001110110010110110101101111110010110101101100000111110001110000010100010111110101110001111100010011110011010001111101101001100111010110111111011001011011010101111111001101001111110010100111011111011110110001010111010111000110110101001111001111010110110001100 e799b2e383abec8aa3e68f96ebb0a7ecb8a6e99fb3ec8e8cecb6b7e5ad83e3828beb8f89e68fb4ceb7ecb6afe69f94efbd8aeb8da9e7ad8c
UHC 癲ル슣揖밧츦音쎌춷孃る돉援η춯柔j덩筌 1110111110100110101010111110101110011010101011111110101111100111101110011110010110101110100111001110101111100101101111011110110010101101100100111110010110111110101010101110101110001001100110011110101010110101101001011110011110101101100011001110101011110101101000111110101010110101101000101110111110100111 efa6abeb9aafebe7b9e5ae9cebe5bdecad93e5beaaeb8999eab5a5e7ad8ceaf5a3eab5a2efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)