To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄μ?誼⑨?音??筌??宜??碎?こ筌??由 100101101110111110000011110010100011111110001011011000101000011101001000001111111000100110111001001111110011111111100010101000110011111100111111100010110101100000111111001111111110000111101010001111111000001010110001111000101010001100111111001111111001011101010010 96ef83ca3f8b6287483f89b93f3fe2a33f3f8b583f3fe1ea3f82b1e2a33f3f9752
EUC-JP 厄μ?誼??音??筌??宜??碎?こ筌??由 1100110011110001101001101100110000111111101101011100001100111111001111111011001010111011001111110011111111100100101001010011111100111111101101011011100100111111001111111110001011101100001111111010010010110011111001001010010100111111001111111100110110110011 ccf1a6cc3fb5c33f3fb2bb3f3fe4a53f3fb5b93f3fe2ec3fa4b3e4a53f3fcdb3
UTF-8 厄μ옓誼⑨쬂音ㅼ졋筌뗰퐤宜쇽쭓碎듬こ筌뗫쵆由 1110010110001110100001001100111010111100111011001001100010010011111010001010101010111100111000101001000110101000111011001010110010000010111010011001111110110011111000111000010110111100111011001010000110001011111001111010110110001100111010111001011110110000111011011001000010100100111001011010111010011100111011001000011110111101111011001010110110010011111001111010001010001110111010111001001110101100111000111000000110010011111001111010110110001100111010111001011110101011111011001011010110000110111001111001010010110001 e58e84cebcec9893e8aabce291a8ecac82e99fb3e385bceca18be7ad8ceb97b0ed90a4e5ae9cec87bdecad93e7a28eeb93ace38193e7ad8ceb97abecb586e794b1
UHC 厄μ옓誼⑨쬂音ㅼ졋筌뗰퐤宜쇽쭓碎듬こ筌뗫쵆由 1110010011111000101001011110110010011110100110011110101111111110101010001110111110100110100110011110101111100101101001001110110010100000101110101110111110100111100010111110111110111101100011011110101111110001101111001110111110100111100010111110000111101111101101011110101110101010101100111110111110100111100010111110101110101100100010001110101110100110 e4f8a5ec9e99ebfea8efa699ebe5a4eca0baefa78befbd8debf1bcefa78be1efb5ebaab3efa78bebac88eba6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)