To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???癲??淫?????陰??陰?????B 0011111100111111001111111110000110011111001111110011111110001000111110100011111100111111001111110011111100111111100010010100000100111111001111111000100101000001001111110011111100111111001111110011111101000010 3f3f3fe19f3f3f88fa3f3f3f3f3f89413f3f89413f3f3f3f3f42
EUC-JP ???癲??淫?????陰??陰?????B 0011111100111111001111111110001010100001001111110011111110110000111111000011111100111111001111110011111100111111101100011010001000111111001111111011000110100010001111110011111100111111001111110011111101000010 3f3f3fe2a13f3fb0fc3f3f3f3f3fb1a23f3fb1a23f3f3f3f3f42
UTF-8 溜깅젡癲좎꽌淫먮젿溜싧뒔陰쎌꺃陰쎈젿溜뽰뀒B 11101111101001111000101111101010101110011000010111101100101000001010000111100111100110011011001011101100101000101000111011101010101111011000110011100110101101111010101111101011101010001010111011101100101000001011111111101111101001111000101111101100100010111010011111101011100100101001010011101001100110011011000011101100100011101000110011101010101110101000001111101001100110011011000011101100100011101000100011101100101000001011111111101111101001111000101111101011101111011011000011101011100000001001001001000010 efa78beab985eca0a1e799b2eca28eeabd8ce6b7abeba8aeeca0bfefa78bec8ba7eb9294e999b0ec8e8ceaba83e999b0ec8e88eca0bfefa78bebbdb0eb809242
UHC 溜깅젡癲좎꽌淫먮젿溜싧뒔陰쎌꺃陰쎈젿溜뽰뀒B 11101010111111101011000111101011101000001001101011101111101001101010000011101100100001001001110011101011111000101001000011101011101000001011000111101010111111101001101011100101100010101001000111101011111001001011110111101100100000111010110011101011111001001011110111101011101000001011000111101010111111101001011011101100100001011000110001000010 eafeb1eba09aefa6a0ec849cebe290eba0b1eafe9ae58a91ebe4bdec83acebe4bdeba0b1eafe96ec858c42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)