To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??泣??筍ゃ?h癲??泣??筍ゃ? 111000011001111100111111001111111000101110000011001111110011111111100010101000011000001011100001001111110110100011100001100111110011111100111111100010111000001100111111001111111110001010100001100000101110000100111111 e19f3f3f8b833f3fe2a182e13f68e19f3f3f8b833f3fe2a182e13f
EUC-JP 癲??泣??筍ゃ?h癲??泣??筍ゃ? 111000101010000100111111001111111011010111100011001111110011111111100100101000111010010011100011001111110110100011100010101000010011111100111111101101011110001100111111001111111110010010100011101001001110001100111111 e2a13f3fb5e33f3fe4a3a4e33f68e2a13f3fb5e33f3fe4a3a4e33f
UTF-8 癲ⓥ뫖泣숃굢筍ゃ럦h癲ⓥ뫖泣숃굢筍ゃ럦 11100111100110011011001011100010100100111010010111101011101010111001011011100110101100111010001111101100100010001000001111101010101101011010001011100111101011011000110111100011100000101000001111101011100111111010011001101000111001111001100110110010111000101001001110100101111010111010101110010110111001101011001110100011111011001000100010000011111010101011010110100010111001111010110110001101111000111000001010000011111010111001111110100110 e799b2e293a5ebab96e6b3a3ec8883eab5a2e7ad8de38283eb9fa668e799b2e293a5ebab96e6b3a3ec8883eab5a2e7ad8de38283eb9fa6
UHC 癲ⓥ뫖泣숃굢筍ゃ럦h癲ⓥ뫖泣숃굢筍ゃ럦 11101111101001101010100011100010100100011011100011101011111010001001100111101000100000101000100111100010111011001010101011100011100011101000100101101000111011111010011010101000111000101001000110111000111010111110100010011001111010001000001010001001111000101110110010101010111000111000111010001001 efa6a8e291b8ebe899e88289e2ecaae38e8968efa6a8e291b8ebe899e88289e2ecaae38e89

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)