To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?孰?煙?荀巍?}v?孰?煙?荀巍?}vB 0011111110011011011110000011111110001001100011000011111111100100101001001001101111011001001111110111110101110110001111111001101101111000001111111000100110001100001111111110010010100100100110111101100100111111011111010111011001000010 3f9b783f898c3fe4a49bd93f7d763f9b783f898c3fe4a49bd93f7d7642
EUC-JP ?孰?煙?荀巍?}v?孰?煙?荀巍?}vB 0011111111010101110110010011111110110001111011000011111111101000101001101101011011011011001111110111110101110110001111111101010111011001001111111011000111101100001111111110100010100110110101101101101100111111011111010111011001000010 3fd5d93fb1ec3fe8a6d6db3f7d763fd5d93fb1ec3fe8a6d6db3f7d7642
UTF-8 롒孰렲煙롒荀巍렳}v롒孰렲煙롒荀巍렳}vB 1110101110100001100100101110010110101101101100001110101110100000101100101110011110000101100110011110101110100001100100101110100010001101100000001110010110110111100011011110101110100000101100110111110101110110111010111010000110010010111001011010110110110000111010111010000010110010111001111000010110011001111010111010000110010010111010001000110110000000111001011011011110001101111010111010000010110011011111010111011001000010 eba192e5adb0eba0b2e78599eba192e88d80e5b78deba0b37d76eba192e5adb0eba0b2e78599eba192e88d80e5b78deba0b37d7642
UHC 롒孰렲煙롒荀巍렳}v롒孰렲煙롒荀巍렳}vB 10001110110101111110001011010101100011101011111111100110110101011000111011010111111000101111000011101000111001001000111011000000011111010111011010001110110101111110001011010101100011101011111111100110110101011000111011010111111000101111000011101000111001001000111011000000011111010111011001000010 8ed7e2d58ebfe6d58ed7e2f0e8e48ec07d768ed7e2d58ebfe6d58ed7e2f0e8e48ec07d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)